Search results for "genomic data"
showing 10 items of 23 documents
Comparative Mitogenomics of Leeches (Annelida: Clitellata): Genome Conservation and Placobdella-Specific trnD Gene Duplication.
2015
Mitochondrial DNA sequences, often in combination with nuclear markers and morphological data, are frequently used to unravel the phylogenetic relationships, population dynamics and biogeographic histories of a plethora of organisms. The information provided by examining complete mitochondrial genomes also enables investigation of other evolutionary events such as gene rearrangements, gene duplication and gene loss. Despite efforts to generate information to represent most of the currently recognized groups, some taxa are underrepresented in mitochondrial genomic databases. One such group is leeches (Annelida: Hirudinea: Clitellata). Herein, we expand our knowledge concerning leech mitochon…
Glomeromycotina: what is a species and why should we care?
2018
International audience; A workshop at the recent International Conference on Mycorrhiza was focused on species recognition in Glomeromycotina and parts of their basic biology that define species. The workshop was motivated by the paradigm-shifting evidence derived from genomic data for sex and for the lack of heterokaryosis, and by published exchanges in Science that were based on different species concepts and have led to differing views of dispersal and endemism in these fungi. Although a lively discussion ensued, there was general agreement that species recognition in the group is in need of more attention, and that many basic assumptions about the biology of these important fungi includ…
Reactome graph database: Efficient access to complex pathway data
2018
Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its qu…
MiasDB: A Database of Molecular Interactions Associated with Alternative Splicing of Human Pre-mRNAs.
2016
Alternative splicing (AS) is pervasive in human multi-exon genes and is a major contributor to expansion of the transcriptome and proteome diversity. The accurate recognition of alternative splice sites is regulated by information contained in networks of protein-protein and protein-RNA interactions. However, the mechanisms leading to splice site selection are not fully understood. Although numerous databases have been built to describe AS, molecular interaction databases associated with AS have only recently emerged. In this study, we present a new database, MiasDB, that provides a description of molecular interactions associated with human AS events. This database covers 938 interactions …
Identification of factors involved in dimorphism and pathogenicity of Zymoseptoria tritici
2017
A forward genetics approach was applied in order to investigate the molecular basis of morphological transition in the wheat pathogenic fungus Zymoseptoria tritici. Z. tritici is a dimorphic plant pathogen displaying environmentally regulated morphogenetic transition between yeast-like and hyphal growth. Considering the infection mode of Z. tritici, the switching to hyphal growth is essential for pathogenicity allowing the fungus the host invasion through natural openings like stomata. We exploited a previously developed Agrobacterium tumefaciens-mediated transformation (ATMT) to generate a mutant library by insertional mutagenesis including more than 10,000 random mutants. To identify gene…
Functional comparison of bacteria from the human gut and closely related non-gut bacteria reveals the importance of conjugation and a paucity of moti…
2016
International audience; The human GI tract is a complex and still poorly understood environment, inhabited by one of the densest microbial communities on earth. The gut microbiota is shaped by millennia of evolution to co-exist with the host in commensal or symbiotic relationships. Members of the gut microbiota perform specific molecular functions important in the human gut environment. This can be illustrated by the presence of a highly expanded repertoire of proteins involved in carbohydrate metabolism, in phase with the large diversity of polysaccharides originating from the diet or from the host itself that can be encountered in this environment. In order to identify other bacterial fun…
Big Data in metagenomics: Apache Spark vs MPI.
2020
The progress of next-generation sequencing has lead to the availability of massive data sets used by a wide range of applications in biology and medicine. This has sparked significant interest in using modern Big Data technologies to process this large amount of information in distributed memory clusters of commodity hardware. Several approaches based on solutions such as Apache Hadoop or Apache Spark, have been proposed. These solutions allow developers to focus on the problem while the need to deal with low level details, such as data distribution schemes or communication patterns among processing nodes, can be ignored. However, performance and scalability are also of high importance when…
A summary of genomic databases: overview and discussion
2009
In the last few years both the amount of electronically stored biological data and the number of biological data repositories grew up significantly (today, more than eight hundred can be counted thereof). In spite of the enormous amount of available resources, a user may be disoriented when he/she searches for specific data. Thus, the accurate analysis of biological data and repositories turn out to be useful to obtain a systematic view of biological database structures, tools and contents and, eventually, to facilitate the access and recovery of such data. In this chapter, we propose an analysis of genomic databases, which are databases of fundamental importance for the research in bioinfo…
Abstract A22: PanDrugsDB: Identifying druggable genetic dependencies for personalized cancer therapy
2015
Abstract The paradigm of personalized medicine is the identification of the appropriate drug for the right patient, using molecular profiles. In Oncology, it is well established that the anticancer drugs are effective in only a small subset of patients. Moreover, many of the new targeted therapies inhibit specific proteins, and they are only effective in tumors that are genetically altered. Consequently, the success of personalized treatment depends on each individual molecular profile, which a priori can be considered as very heterogeneous. Here, we present a new computational approach (PanDrugsDB) based on the analysis and integration of genomic data (mutations, copy number variations or …
Towards next generation diagnostics for tuberculosis: identification of novel molecular targets by large-scale comparative genomics
2019
AbstractTuberculosis remains one of the main causes of death worldwide. The long and cumbersome process of culturingMycobacterium tuberculosiscomplex (MTBC) bacteria has encouraged the development of specific molecular tools for detecting the pathogen. Most of these tools aim to become novel tuberculosis diagnostics, and big efforts and resources are invested in their development, looking for the endorsement of the main public health agencies. Surprisingly, no study had been conducted where the vast amount of genomic data available is used to identify the best MTBC diagnostic markers. In this work, we use large-scale comparative genomics to provide a catalog of 30 characterized loci that ar…